tt a tt St Ht tt HH HH HH Ha 
tt a tt a a tt i 


T 
T 
T 
T 
T 
T 
T 
T 
T 
T 
T 
T 
T 
T 
T 
T 
T 
T 
T 
T 
T 


HHH HHH 
HHH HHH 
HHH HHH 
HHH HHH 
HHH HHH 
HHH HHH 
HHH HHH 
HHH HHH 
HH HHH 
HHHHHHHHHHHHHHH 
HHHHHHHHHHHHHHH 
HHHHHHHHHHHHHHH 
HHH HHH 
HHH HHH 
HHH HHH 
HHH HHH 
HHH HHH 
HHH HHH 
HHH HHH 
HHH HHH 
HHH HHH 


tt tt tt st 4 Ht Ht Ht Ht Ht Ht HH HH 
AAA a 
| LE) 


LLLLLLLLLLLLLLL 
LLLLLLLLLLLLLLL 
LLLLLLLLLLLLLLL 


*eF 1LE**1D**MTHSQRT 


LLLLLLLLLL 
LLLLLLLLLL 


444 
a4 
44 


AAA at St SH 
AAAs 


akan 
ee 
Dt Pn Bt Be ae ne hes es Pe Pe ee 
to ee 
bat 


Pat ba me bene Pee Gans Pet Os bd Bg Gene Bs Ee 


oa 
es be 
at hee 
a 


4 


= 


— 
a4 
a4 
a4 


AAA tt St tt 
pn ee ee eee 


MTHSSQRT 
Table of contents 


sg 


bp we 


15 
; Floating Point Square Root renive 16-SEP-1984 01:51:08 VAX/VMS Macro Vv04-00 


eistesy petetted Nay pease: Edit Histor 


DECLARA ations 
H$SQ BAY 
MIHSSORT _R3 


Stang 


Lorarive Part of hodu 
any ngle recision Float 
B SQRT routine 


a SQRT 


Page 


0 


ae 


rae ; Floating Point Square Root me A Tense b=18 : 91: 3}: 98 oA Macro V04-00 Page 1 


- MTHRTL.SRC JMTHSQRT MAR; 1 (1) 
TITLE MTHSSQRT g Floating Point Square Root routine 
IDENT /1-015/ ; File: MTHSQRT.MAR EDIT RNH1015 


| eeuaeeneenseneunsneenceesonenonenenseeteneeeeeebenenensenasoeqnonneeoooocnes 


COPYRIGHT (c) 1978, 1980, 1982, 1984 BY 
DIGITAL EQUIPMENT CORPORATION, MAYNARD, MASSACHUSETTS. 
ALL RIGHTS RESERVED. 


*® ® 
*® e 
°® ® 
°® ® 
* ® 
s® THIS SOFTWARE IS FURNISHED UNDER A LICENSE AND MAY BE USED AND COPIED * 
* ONLY IN ACCORDANCE WITH THE TERMS OF SUCH LICENSE AND WITH THE * 
:@ INCLUSION OF THE ABOVE COPYRIGHT NOTICE. THIS SOFTWARE OR ANY OTHER * 
:® COPIES THEREOF MAY NOT BE PROVIDED OR OTHERWISE MADE AVAILABLE TO ANY * 
:# OTHER PERSON. NO TITLE TO AND OWNERSHIP OF THE SOFTWARE IS HEREBY * 
:# TRANSFERRED. * 
bad ® 
* THE INFORMATION IN THIS SOFTWARE IS SUBJECT TO CHANGE WITHOUT NOTICE 
;# AND SHOULD NOT BE CONSTRUED AS A COMMITMENT BY DIGITAL EQUIPMENT * 
:* CORPORATION. * 
°® ® 
ca ® 
* ® 
°® ® 
*® * 
*® ® 


DIGITAL ASSUMES NO RESPONSIBILITY FOR THE USE OR RELIABILITY OF ITS 
SOFTWARE ON EQUIPMENT WHICH IS NOT SUPPLIED BY DIGITAL. 
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; ABSTRACT: 


; MTH$SQRT is a function which returns the floating point square root 
; p oF its sin ~ _Srestaten floating point argument. The call is standard 
call- my nce. 
; MTH$SQ TRS. is a special routine which is the same as MTH$SQRT except 
; a faster non-standard JSB call is used with the argument in RO and no 
; registers are saved. 


; ; VERSION: 01 


; HISTORY: 
; AUTHOR: 
Peter Yuo, 15-Oct-76: Version 01 


; MODIFIED BY: 


3 A Peter Yuo, g2-ney=77 


1-2 Peter Yuo, 31-May-77 
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i FACILITY: MATH LIBRARY 
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MTHSSQRT 3; Floating Point Squ 1984 01:51: AX/VMS Macro V04-00 Page 
- ; Detailed Current t History -SEP- 227: MTHRTL.SRCIMTHSQRT.MAR; (2) 
More AISTORY s-Detailed tc Edit Hi Boeri bbe 91:35:95 PANANTS Bacts Te tetas tl 
f -SBTTL HISTORY ; Detailed Current Edit History 
8 3; ALGORITHMIC DIFFERENCES FROM FP=11/C ROUTINE: none 
3 t History for Version ° H 
Edit Hi f ion 01 of MTHS$SQRT 
8 : 917) Code saying after code review 
65 ; Ol-2 ROTL shift in garbage into highest bit. Use oo instead. 
B8 64 ; ADDL neteuction after ADJUST - een changed. ate ADDW to prevent 
0 65 ; oyer flow if R1<31: ph FFFF * Rd 1:1 
88 96 3 §1-3 Finish error handl am rig pd 
3 01-5 MTHSSERROR chan > g te Arh Sfouan 
44 o8 3 at eee Chan nge MTH_ 
000 8 ; nged error handling mechanism. Put error result in RO before 
44 2 3 a) ng MTHSSSIGNAL in order to o allow us user _modify error result. 
000 3 1-6 Return -0.0 on negative arg. -77 
000 i 3 Q1-7 Edit in Rich Lary’s code ae, JSB rou tine is now _R3. JMT 19-Jan-78 
000 73 ; 01-9 Move .ENTRY symbol to module header. TNH 14-Aug-78" 
0000 74 ; 1-010 = Put version number >. Stencere vecuees three ignt edit 
0000 75; numbers. Also one Mat Af notice. S 16-NOV-78 
0000 6 3 1-011 - change, ATH. SQuRO0 5° to CaTHSe UROONEG 38 S 07-DEC-78 
0000 77; pe 3 - o the P TeSirestive... 
0000 78 ; 1-3) ~ Dec lave po B yg May-19 
0000 79 ; 1-014 = Move MTHS$SQRT_R2 to gai Roe Sau module (MTHSQRTR2.MAR) and 
0000 80 ; replace with atiss eps 
3 I= = Changed W* to n call to MTH AL RNH 09- ept- 
0000 81 ; 1-015 = Ch d G* in all ' $$S1G RNH 09-S 1981 
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ATIONS ; Declarative Part of Modul $ 


INCLUDE FILES: 


EXTERNAL SYMBOLS: 


-DSABL GBL 
~EXTRN MTHSK_SQUROONEG 
-EXTRN MTHSSSIGNAL 


; EQUATED SYMBOLS: 


3 MACROS: none 
; PSECT DECLARATIONS: 


SDOOOOCSOeSSOOOOSOSSoSoOS 


: OWN STORAGE: none 
> CONSTANTS: 


3; Constants A and B chosen for k = odd 


MOOD NAMNE WIN OOD NAME WN 9 OD NAUE WN OOONAUE 
eee 


SOSOSSSSSCSOSSSOSOOOCS OOOO 
SSSSSSSSSSSOSSSSSSSSS 

— ed ot td 2 ws 8 

RR SS | 2 QOD DOOD 0000 00000000 


OGoOoQooooooooooooo 


SOooooooooooeo 


LF _ODD_A_E63 = *KX13CDSFD4 
LF_ODD_B_EM63 = *x%3C4A2018 
; Constants A and B chosen for k = even 
LF_EVEN_A = *XF61A4015 
LF_EVEN_B_EM64 = *X4B231FD7 


AX/VMS Macro v04-00 
MTHRTL.SRCIMTHSORT MAR; 1 


-SBTTL DECLARATIONS ; Declarative Part of Module 


ACMASK = “M<IV, 82, R3> 3 register save mask and IV enable 


~-PSECT _MTHSCODE PIC,SHR,LONG, EXE ,NOWRT 
3; program section for math routines 
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ndard Single precision FL g-SEp=19 4 1:39:98 EMTHRTL SREIMTHSORT. MAR: 1 ws (4) 


-SBTTL MTHSSQRT = Standard Single Precision Floating SORT 


frhssa 


: FUNCTIONAL DESCRIPTION: 

: SQRT = single precision floating point function 

: SQRT(X) is computed using the following approximation technique: 
If X <= 0, error. Let X = IX}. 
Let X = 2e*K * F where F is the fractional part. 


If K = even, X = 2e*(2P) * F 
SQRT(X) = 2eeP ® SORT(F), 1/2 =< F< 1 


odd, X = 2e(2P41) * F = 2e8(2P+2) ® (F/2) 
SQRT(X) = 2ee(P41) ® SQRT(F/2), 1/4 =< F/2 < 1/2. 


Let F° = AtF + B, 


OOOOCCGOOCOOOCOOOOCOoSoOoO -~—- 
SOOCCCOCCOOCOCOOCOOCOOCOOOOOOOOOOOOoOOO 


or 
une un 


or 


and 
K' Ps. for K = 
P +14 for K = 
} 
Let YO = 2**k' * F* as a staight line perestaat ies wthin the 
given interval using coefficients A and B which minimize the 
e midpoint and endpoint. 


OoOoooooooooooo 


Sss 


a a a a kk dd dd ddd dd dd es UO 
SNS SSD DDD DEDEDE DED. De TTT BE BE BB BB ANIA NII AIUPIPNIPUNINPYD st? 
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absolute error at t 
Starting with YO, two Newton-Raphson iterations are performed. 
YCn+1) = (1/2) * ( YEn] + xX/¥En]) 
; The relative error is < 10**-8, 
; CALLING SEQUENCE: 
sart.wf.v = MTH$SOQRT(x.rf.r) 
INPUT PARAMETERS: 


COoooeo 


: = At(F/2) + B 


: 


00000004 LONG = 4 ; define Longwarg multiplier 
00000004 x = 1 * LONG 3; Contents of x is the argument 
IMPLICIT INPUTS: none 


OUTPUT PARAMETERS: 
VALUE: floating square root of the argument 
IMPLICIT OUTPUTS: none 
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"s Standard Single Precision Fl raat 7 94:35:98 ERTHRTL. SREIMTHSORT. MAR; 1 - (4) 


COMPLETION CODES: none 
SIDE EFFECTS: 


Signals: MTH$_SQUROONEG if X < bg with reserved eert and in RO (copied to 
the signal mechanism vector CHFSL_MCH_RO/R1 by LIBSSIGNAL). 

Associated message is: "SQUARE ROOT OF NEGATIVE VALUE'’. Result is reserved 
operand -0.0 unless a user supplied (or any) error handler changes CHFS$L_MCH_RO/R1 


NOTE: This procedure disables floating point underflow, enables integer 
overflow, causes no floating overflow or other arithmetic traps, and 
preserves enables across the call. 


ENTRY MTH$SQRT, ACMASK 3 standard call-by-reference entry 
: disable DV (and FU), enable IV 
MTHSFLAG_ JACKET ; flag that this is a jacket procedure in 


MOVAB G*MTHSSJACKET_HND, (FP) 
set handler address to jacket 
handler 


case of an error in special routine 
RO = arg 

call specail SQRT rountine 

return - result in RO 


MOVF  ax(AP), RO 
BSBB MTHSSQRT_R3 


MTHSSQRT ; Floatin Point Square Root ro alae” 16-SEP-19 1: AX/VMS Macro V04-00 Pa 
ators ATHSSORT R3 = JSB SORT routine. ~$een13be 91:34:98 PAMANTS aese NOGRsoman:1 28 8, 
] 5 -SBTTL MTHSSQRT_R3 = JSB SORT routine 
i : : JSB SQRT = used by the standard, and directly. 
4 § : CALLING SEQUENCE: 
1 10 ; save anything in RO:R2 
1 11 3 R : input in RO 
1 12; JSB MTH$saRT_R3 
} 17 3 return with root in RO 
901 15 ; Note: This routine is written to goers ony integer overflows, floating overflows, | 
4 1§ ; floating underflows or divide by 0 conditions, whether enabled or not. 
or 18 : REGISTERS USED: 
1 17 3 RO = Floating argument then result 
01 0; RI - X saved for use during iteration 
id 13 R2 = scratch 
Bat 5 MTHSSQRT_R3:: 3; JSB routine for SQRT 
51 39 50 001 4 MOVE RO, R1 ; test sign of X and save it in R1. 
5315 0013 3 BLEQ ZERO_NEG > branch to FERO. NEG if X =< 0 | 
$018 7:x>0 | 
0015 8; | 
0015 9 Pos: 
52 50 3¢ 0015 0 MOVZWL RO, R2 : isolate low 16 bits S iste oup.Yiceet in R 
2§ 94 0018 31 CLRB 3; R2 now has “- and ? exp bits 
a AA dts 3 BICW R2, RO 3; clear sign and left 7 exp bits 
50 95 001D 3 TSTB RO 3; check low bit of exp 
10 18 OO1F 34 BGEQ EVEN 3; and branch if 1 
50 13CD5FD4 8F 44 0021 35 MULF #LF_ODD_A_E63, RO 3 add 64 Chat of bias) to (exponent-2) 
0028 36 : oad start approximation cals 
50 3€4A2018 8F 40 0028 237 ADDF  § #LF_ODD_B_EM63, RO : RO = (first approx) * 2**-64 
13. 11 O02F 38 ARB ADJOST 3 go adjust 
0031 39 
0031 240 EVEN: 
50 2000 8F Ad O08) 41 ADDW #*x2000, RO ; exp is 0 - make it 64 (2**-64) for Legalit | 
50 ~F61A4015 BF 44 0036 4g MULF § #LF_EVEN_A, RO | 
50 4B231FD7 8F 40 Boze re sgmust ADDF #LF_EVEN_B_EM64, RO ; RO = (first approx) * 2%#-64 
S52 52 IF 9C 0044 245 “ ROTL #31, R2, R2 : divide R2 (expsbias) by 2, 
0048 46 : Giving (exp/2+64) 
50 52 a0 0048 47 ADDW R2, RO : insert exp/2 in first approx and 
Ree +8 3; re-bias it. 
984 29 : first iteration - single precision is sufficient 
$2. 51 59 47 0048 3g , DIVES RQ, Ri. R2 ; R2 = 
50 § 40 O04F 5 ADDF ; RO = YO + ¥(v9 
50 0080 8F A2 038 54 SUBW Recah” RO : RO = Y1 = (1/2)¢Y¥0 + X/YO) 
B25 2? 3: no overflow possible 
57 25 : second iteration, do in double precision to get truncated( rather than 
2f a8 : 3; rounded) result. 
U4 60 ::: CLRL ag ; lower part (x) = 0. 
57 el 323 DIVD RO, R1 : divide Y1 into X with low-order 
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pat hrnssanT 83° jsB Sart routine ae et 9 O14: 98 YOTARTL SRC JMTHSQRT.MAR; 1 ° ($). 
¢ 3 3 bits of Y1 garbage. This doesn't 
7 3; effect accuracy, since Y1 innacurate 
ia 
52 1 56 7 65 CVTFD » R2 3 convert and copy X into R2/R3 
1 04 A 96 CLRL ; clear low part of Y1 
26 § 6 2¢ 6 Divbd2 4. Re :; divide niche x 
5 4 F 68 ADOF ; RO = Y1 + par rt(x/v1) 
50 0080 8F A 6 4 SUBW #*x80, RO ; RO = vont? -~ i >» (¥1 + X/Y1) 
0 ? i SQRTX: RSB : return, RO = result 
6 ie 7 x =< 0 
4 74 LERO_NEG: 
FD 0s 13 6 75 BEQL SQRTX 3 return with RO = result = 0 
§ DD 006A i PUSHL (SP) 3: return PC from JSB routine 
—E OO°8F 9A 895 7 MOVZBL #MTHSK_SQUROONEG, -(SP) ; condition value 
50 0 78 78 ASHL #15, #T, RO : Rd 2 aceeult = reserved operand -0.0 
074 79 : es to signal mechanism vector 
0074 80 : Rey $L MCH_RO/R1) so error handler 
0074 81 3 con modify the result. 
00000000'GF 02 FB 0074 3 CALLS #2, G*MTHSSSIGNAL 3 dee error and use real user's PC 
0078 8 $ ependent of CALL v 
0S 0078 84 RSB : pt - RO pestered: tren CHFSL_MCH_RO/R1 
007C 285 
007C 286 ~ END 
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Symbol table 3 : B-SEP-1986 1:27:19 EMTMRTL.SREIMTMSORTOMAR;1 29° (By 


ACMASK = 4 Of 
ADJUST . B 4 
EVEN 31 Rk 1 
F_LEVEN = F6IAG i} 
Lt -EVEN,B B EROS = $62 FD 
D.A_E63_ = 13CD5FD4 
tf “ODD “B_EM63 = 3€4A2018 
LON = 00004 
MTHSSJACKET_HND teeeeeee =X 4 
MTHSSSIGNAL” terranes x 0 
ota 50000000 age 89 
CHEST os $9988808 0 
POS 44/64 ES 01 
SQRTX 00 67 R 01 
x = 00000004 
ZERO_NEG 00000068 R 01 
$e er ew ete ee cence + 
} ;_Psect synopsis ! 
PSECT name Allocation PSECT No. Attributes 
ees, 00000000 ( 0.) 00 ¢ 0.) NOPIC USR CON’ ABS LCL — NOEXE NORD NOWRT NOVEC BYTE 
_MTHSCODE 0000007C ( 124.) 01 ¢ 1.) PIC USR CON REL LCL SHR EXE RD NOWRT NOVEC LONG 
pees eocn en se nner ere moe nes 
; Performance indicators H 
Phase Page faults CPU Time Elapsed Time 
Initialization $3 00:00: 8-1 00:00:00.81 
pomnend processing 109 00:00:00.7 00:00: 23. 91 
s 1 82 00:00:00.8 00:00:03.78 
=a bende of HES 98 888-0 
Syabot table output : 00: 0: 0.0 00:00:00.41 
Psect synopsis output 00:00:00. 00:00:00.0 
Cross-reference output 0 00:00: 3-? ie Se Sy 
Assembler run totals 291 00:00:02.4 00:00:13.4 


HY 3 working et Limit was 900 pages. 
082 byies (8 pages) of virtua Cary were used to buffer Fai sueraeeyoee code. 
all were 10 pages of symbol table space allocated to hold non-local and 0 local symbols. 
346 source Lines were read in Pass 1, producing 11 object records in Pass 2 
1 page of virtual memory was used to “define 1 macro. . 


weaowmoarna son seer ese aewesaoae} 


Macro Library name Macros defined 


~-$255$DUA28: CSYSLIBISTARLET .MLB; 2 


TREE acre mun Searianten + Fleeting Point Severe Rot routlne’ —sE-SE5-1884 1:51:09 WMA SHeSRrMGtaPmunss 8% 


0 GETS were required to define 0 macros. 
There were no errors, warnings or information messages. 


MACRO/ENABLE=SUPPRESSION/D1SABLE=(GLOBAL , TRACEBACK) /LIS=LIS$:MTHSQRT/OBJ=O0BJ$:MTHSORT MSRCS:MTHJACKET/UPDATE=(ENHS:MTHJACKET) +MSRCS: | 


ORPORATION 
OPRIETARY 
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